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Abstract 

The accuracy of the Arthurs-Kelly model of a simultaneous measure- 
ment of position and momentum is analysed using concepts developed 
by Braginsky and Khalili in the context of measurements of a single 
quantum observable. A distinction is made between the errors of 
retrodiction and prediction. It is shown that the distribution of mea- 
sured values coincides with the initial state Husimi function when 
the retrodictive accuracy is maximised (this being the result first ob- 
tained by Busch, using somewhat different methods). It is further 
shown that the distribution of measured values is related to the final 
state anti-Husimi function (the P representation of quantum optics) 
when the predictive accuracy is maximised. The disturbance of the 
system by the measurement is also discussed. A class of minimally 
disturbing measurements is characterised. It is shown that the distri- 
bution of measured values then coincides with one of the smoothed 
Wigner functions described by Cartwright. 
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1. Introduction 

It has been known since the publication of Hcisenberg's original paper [1] that 
quantum mechanics does not allow both the position and the momentum of a 
system to be measured with arbitrary accuracy. However, it does not follow from 
this that one cannot measure both quantities with a less than perfect degree of 
accuracy. Indeed, it would seem that it is essential that quantum mechanics does 
permit such measurements, if it is to be possible to derive classical mechanics from 
quantum mechanics as an approximate theory, valid in some appropriate limit. For 
this and other reasons simultaneous measurements have been the subject of much 
theoretical interest over the years. 

In recent years interest in them has been greatly increased, due to technical ad- 
vances in the field of quantum optics. As a result of these advances simultaneous, 
imperfect measurements of the position and momentum of a quantum mechanical 
system are no longer confined to the idealised world of gedanken experiments. They 
can actually be realised in the laboratory [2, 3, 4, 5, 6, 7, 8, 9]. For a recent review 
of these methods, and of the closely related subject of the tomographic reconstruc- 
tion of the quantum state, the reader may consult Leonhardt and Paul [10], or 
Leonhardt [11]. 

In the following we shall be concerned with the problem of describing the accu- 
racy of such measurements, and the disturbance they produce in the system whose 
position and momentum are being measured. In a previous paper [12] we developed 
some general methods for analysing this problem. Our purpose now is to illustrate 
the use of these methods, by applying them to the example of the Arthurs-Kelly 
process [2, 3, 4, 9, 11]. 

The accuracy of, and disturbance produced by a simultaneous measurement 
process has been the subject of many previous investigations. One important rela- 
tionship is the inequality first derived by Arthurs and Kelly [2, 6, 11, 13, 14] 

A/ixf A/ip f > h (1) 

where A/ixf and A/ipf denote the uncertainties in the final pointer positions, after 
the measurement has been completed. This inequality is to be compared with the 
ordinary uncertainty principle 

AxAp>^ (2) 

The doubling of the lower bound in Eq. (1) as compared with Eq. (2) is due to the 
additional noise introduced by the measurement process. 

We discussed the Arthurs-Kelly inequality in ref. [12]. We will here content our- 
selves with remarking, that although the product A/i X f A^ Pf provides a numerical 
indication of the overall accuracy of the measurement, the quantities A^ X f and 
A^ipf arc not themselves directly interpretable as experimental errors. One would 
like to find quantities which can be individually and immediately identified with 
the errors in the measured values of position and momentum. 

The problem of defining quantities which do provide a direct characterisation 
of the errors in a simultaneous measurement of position and momentum has been 
systematically addressed within the framework of the approach based on the con- 
cepts of a positive operator valued measure (or POVM) and an unsharp observ- 
able [15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28]. For a recent review, and 
additional references, the reader may consult the book by Busch et al [9] . We give 
a brief discussion of the way in which the measurement errors are defined in this 
approach in the Appendix. 

Our own interest in this problem stems from the work of Uffink [29] , who has 
criticised the POVM approach, and more generally the very concept of a joint 
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measurement of non-commuting observables, in terms which may fairly be described 
as unequivocal. He states 

. . . the formalism of quantum theory, as it is presented by von Neu- 
mann, simply has no room for a description of a joint measurement of 
position and momentum at all. [p. 200] 

He further asserts 

the claim that within this formalism [i.e. the formalism based on on 
the concept of a POVM] a joint unsharp measurement of position and 
momentum or a pair of spin components is possible is false. 

He goes on to assert, that claims that the POVM approach does provide a theory 
of joint measurements, 

rest on the adoption of inappropriate definitions, i.e., definitions that 
trivialize the problem. 

In connection with these statements, we should say at the outset, that we do not 
feel that Uffink's arguments are sufficiently strong to support his conclusion. On 
a purely intuitive level, it seems evident that the processes discussed in refs. [2, 3, 
4, 5, 6, 7, 8, 9, 10, 11] must be describable as joint measurements in some sense of 
the word. It seems no less evident, that it cannot really be correct to dismiss the 
claims of the POVM approach in the way that Ufiink does dismiss them, wholesale, 
as "false" and/or "trivial". In short, it appears to us that Uffink overstates his 
case. However, that is not to say that his criticisms are without foundation. On 
the contrary, we consider that Uffink has raised some very important and interesting 
questions of principle which it would be worthwhile trying to clarify. This was the 
original motive for the work reported in ref. [12]. 

If one examines the statements just quoted it will be perceived that Uffink ac- 
tually makes two different claims, which may be enumerated as follows: 

1. The conventional, generally accepted formulation of quantum mechanics, as 
it is presented in the book by Von Neumann [30] , has no room for the concept 
of a simultaneous measurement of position and momentum. Or, to put it 
another way, the conventional theory of quantum mechanics does not permit 
both the position and momentum of a system to be jointly determined, not 
even with a less than perfect degree of accuracy. 

2. The attempt to solve the joint measurement problem within the framework 
of the approach based on POVM's and unsharp observables fails, because it 
depends on definitions which are "inappropriate" because they "trivialize" 
the problem. 

In ref. [12] we addressed the first of these objections (we intend to address the 
second objection in a future publication). We took it, that a measurement consists 
of a process in which a correlation is established between one set of observables 
(the quantities being measured) and another set of observables (the meter read- 
ings). In the case of interest to us, the observables being measured are the position 
x and momentum p of a quantum mechanical system having one degree of free- 
dom. The observables representing the measurement outcome are two commuting 
pointer observables fix and fip. A process which brings about, simultaneously, both 
a correlation between x and fix, and a correlation between p and fip, is ipso facto 
describable as a simultaneous measurement of x and p. The problem then reduces 
to the problem of giving a precise characterisation of the degree of this correlation. 
In other words, it reduces to the problem of giving a precise characterisation of the 
measurement accuracy. In ref. [12] we showed how this problem can be solved by 
taking some concepts developed by Braginsky and Khalili [31] in the context of sin- 
gle measurements of position only, and applying them to the case of simultaneous 



3 



measurements of position and momentum together. At no stage did we go beyond 
the conventional theory of quantum mechanics, as it is presented by Von Neumann. 
We were therefore able to conclude, that contrary to what Uffink asserts, the con- 
ventional theory does permit the simultaneous, albeit less than perfectly accurate, 
measurement of both the position and the momentum of a quantum mechanical 
system. 

Our motive for undertaking the research reported in ref. [12] was the desire to 
clarify a fundamental question of principle. However, in the course of this research 
it became clear to us that the methods we had developed had an interest which 
went somewhat beyond the purpose for which we had originally intended them. It 
appears to us that they provide one with a way of analysing the physics of simul- 
taneous measurement processes which is useful, both as a mathematical technique, 
and as a source of physical insight. 

We discussed the fundamental concepts on which our approach in based in 
ref. [12] (also see Section 2 below). Our approach rests on making a careful dis- 
tinction between the retrodictive and predictive (or determinative and preparative) 
aspects of a joint measurement process. Corresponding to this distinction we intro- 
duce two different kinds of error: a pair of retrodictive errors and a pair of predictive 
errors. We also define a pair of disturbances, providing a quantitative indication of 
the disturbance of the system by the measurement process. We then derive a total 
of six inequalities relating these quantities: viz a retrodictive error relationship, a 
predictive error relationship, and four error-disturbance relationships. As we stated 
above, our approach represents a development of the analysis given by Braginsky 
and Khalili [31] for the case of single measurements of position only. 

In the case of joint measurement processes Ali and Prugovecki [18] give a defi- 
nition which is essentially equivalent to our definition of the errors of retrodiction 
for the class of processes to which their definition is applicable. In particular, our 
definition is essentially equivalent to theirs in the case of the Arthurs-Kelly process, 
which forms the subject of this paper. However, it can be shown (see the Appendix) 
that our definition is the more general of the two, in that it applies to a class of 
measurement processes which is strictly larger than the class considered by Ali and 
Prugovecki. Also, our definition provides additional physical insight even in those 
cases where it is equivalent to — gives the same numerical values as — the definition 
of Prugovecki and Ali. We used this fact in ref. [12], when trying to answer Uffink's 
objection, that the conventional theory of quantum mechanics, as formulated by 
von Neumann, has no room for the concept of a simultaneous measurement of 
non-commuting quantities. 

However, the most important feature of our approach is the way in which we 
analyse the distinction between the retrodictive and predictive aspects of a joint 
measurement process, and the way in which we relate these two aspects to the 
disturbance produced by the measurement. The distinction between retrodiction 
and prediction has been discussed by several other authors, as has the reciprocity 
between the accuracy of a joint measurement, and the disturbance produced [9, 15, 
22, 23, 24, 26, 32, 33]. However, it appears to us that our method of analysis is a 
significant source of additional insight. It also has certain technical advantages, in 
that it provides a numerical characterisation of the various contrasting features of 
a joint measurement process which is particularly concise, and convenient. 

In the following sections we will illustrate these points, with an analysis of the 
Arthurs-Kelly process. This process has been the subject of several previous inves- 
tigations [2, 3, 4, 8, 9, 11, 23]. We derive a number of new results. However, in some 
cases we only give an alternative derivation of a result which is already known. We 
thought that this was worth doing since our primary purpose is to illustrate the use 
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of some general methods, which have an application to many other measurement 
processes. 

We begin, in Section 2 by giving a brief summary of the relevant results from 
rcf. [12]. In section 3, we show how the initial apparatus state may conveniently be 
expanded in terms of eigenstates of the retrodictive and predictive error operators. 
We then use this fact to show how the distribution of measured values depends on 
the distribution of retrodictive errors; and how the final state of the system depends 
on the distribution of predictive errors. In particular, by appropriately choosing the 
distribution of retrodictive errors, it is possible to ensure that the distribution of 
measured values is given by any member of the class of "operational distributions" 
which are obtained by taking the convolution of two Wigner functions, as discussed 
by Davies [16], Wodkiewicz [34] and others [9, 28, 35, 36]. With the appropriate 
choice for the distribution of predictive errors it is possible to prepare the system 
in any desired pure state. 

In section 4 we consider retrodictively optimal processes: i.e. processes which 
minimise the product of retrodictive errors. We show that in every such case the 
distribution of results is given by the initial system state Husimi function [37, 38] 
(or Q representation). This is the result first obtained by Ali and Prugovecki [18] 
and, in the context of the Arthurs-Kelly process, by Busch [23]. 

In section 5 we consider predictively optimal processes, which minimise the prod- 
uct of predictive errors. We show that the distribution of measured values is then 
related to the final system state anti-Husimi function [38, 39] (or P representation). 

The distributions of retrodictive and predictive errors are independent of one 
another: it is possible for a measurement to be optimal retrodictively, whilst being 
very poor predictively, or vice versa. In section 6 we consider completely optimal 
processes: i.e. processes which are both retrodictively and predictively optimal, 
and which also minimise the degree of disturbance. 

Finally, in section 7, we consider the disturbance of the system by the measure- 
ment. It is possible to reduce the level of disturbance below that produced by an 
optimal measurement, provided one is willing to accept a reduced degree of accu- 
racy. In section 7 we consider measurements which give the maximum degree of 
accuracy consistent with a given level of disturbance. 

Before concluding these introductory remarks we should, perhaps, observe that 
the contrast we drew above, between the "POVM approach," and the approach 
adopted in the present paper, is potentially misleading. It is, in fact, almost impos- 
sible to talk about simultaneous measurement processes without tacitly introducing 
the concept of a POVM. When we distinguish our own approach from the POVM 
approach we do not mean to suggest that POVM's are not implicitly present in our 
analysis. We only mean that the concept does not explicitly play the same central 
role that it does in, for example, the book by Busch et al [9]. 

In the following pages we make no direct use of results obtained within the 
context of the POVM approach. That is because the work reported here is a 
continuation of the work reported in ref. [12]. The original motive for this work 
was our desire to answer the first of Uffink's objections (as itemised above). In 
order to do so we considered it to be necessary to go back to first principles, and 
to think things through from the foundation up. We were thereby led to a different 
approach to the theory of simultaneous measurements of position and momentum, 
which it seems most natural to present in the manner in which we discovered it, 
independently of the POVM approach. 

It would obviously be desirable to relate the approach taken in this paper to 
the POVM approach. However, that is a task which we prefer to leave to a future 
publication, when we also hope to address the second of Uffink's objections (as 
itemised above). 
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2. Definition of the Errors and Disturbances 

In the process described by Arthurs and Kelly [2] a system, with position x and 
momentum p, interacts with an apparatus characterised by two pointer observables 
Ax (measuring the value of x) and Ap (measuring the value of p). Let 7Tx and ttp 
be the momenta conjugate to Ax and Ap respectively. Then 

[x,p] = [Ax,7rx] = [Api^p] = it 1 

all other commutators between the operators x, p, fix, 7Tp, fip, tcp being zero. 
The measurement interaction is described by the unitary evolution operator 

U = cxp [- 1 (n P p + n x x)] (3) 

We assume that system+apparatus are initially in the product state \tp ® <f> ap ), \ip) 
being the initial state of the system, and \4>& v ) the initial state of the apparatus. 

In order to define the errors and disturbances we switch to the Heisenberg picture. 
Let be any of the Schrodinger picture operators x, p, fix, ttp, fip, 7Tp- Let 0; = 
be the value of the corresponding Heisenberg picture operator immediately before 
the interaction. Let Of = U'OU be its value immediately afterwards. Then 

Xf = X + TTp pf — p — 7Tx 

MXf = fiX + X + ±7T P 7T X f = 7TX (4) 

MPf = AP + P - I^X TTXf = 7TX 

Define the operators exi, epi, £xf, tpf, ^x, $p by 

exi = Axf - xi e Pi = Apf ~ Pi 

exf = Axf - Xf epi = Apf - Pf (5) 

Sx = x f -Xi S P = pf - pi 

Following Braginsky and Khalili [31] we then define the rms errors of retrodiction 

i i 

A ci X = ((V> ® ap | £ X i |^ ® <£ap)) 2 A ci p = (("0 ® </> ap | e|i |"0 ® 0ap)) ' 

(6) 

and the rms disturbances 

i i 
A d x = ((-(/> ®</> a p|<$ x |V'®0ap)) 2 A d p = ((V> ® ap | 5 P |V> ® ap )) 2 

A e ja;, A e ij» correspond to the quantities Ax moasuro , Ap moasuro defined by Braginsky 
and Khalili in the context of single measurements of x or p only. They provide 
a numerical indication of the accuracy with which the result of the measurement 
reflects the initial state of the system. A d x, A d p correspond to Braginsky and 
Khalili's Ax pcrtU rb, Ap pcrtU rb- They provide a numerical indication of the degree 
to which the measurement perturbs the state of the system. 
We also define the rms errors of prediction 

A cf x = ((^®<?!>ap|exf |'0®0a P )) 2 A ef p= ((V>®0a P |epf IV'®^)) 2 



These quantities provide a numerical indication of the accuracy with which the 
result of the measurement reflects the final state of the system. Braginsky and 
Khalili do not consider this second kind of error. 

We have given a detailed discussion of the interpretation of the quantities A e iX, 
A ei p, A ef x, A cf p, A d x, A d p in rcf. [12]. 
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In view of Eqs. (4) and (5) we have 

exi = Ax + |ttp epi = Ap — ^x 

exf = Ax - ^Trp epf = Ap + i^x (7) 



£x = ttp <5p = -ttx 



Consequently 



[eXn £Pi] 


= — ih 


[exf, epf] 


= ih 


[exi,<5p] 


= —ih 


[exf,5p] 


= —ih 


[<5x,£pil 


= —ih 


r<*x,epfl 


= —ih 



all other commutators between the operators exi, epi, exf, epf, ^x, ^p being zero. 
We deduce the retrodictive and predictive error relationships 

A ci x A oi p > I (8) 

A cf xA of p>^ (9) 
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and the four error-disturbance relationships 



. . h hj 

A ei x A d p > - A ci x A d p > - 

ft ft < 10 > 

A ci pA d x > - A cf pA d x > - 

Eqs. (8) and (9) jointly comprise a precise, quantitative statement of the well-known 
principle, that the product of the errors in a simultaneous measurement of position 
and momentum must be greater than a number <~ ft. This principle is logically 
distinct from the uncertainty principle usually so-called [13, 14, 33, 40]. 

In ref. [12] we have shown that Eqs. (8-10) hold for many other simultaneous 
measurement processes, apart from the Arthurs-Kelly process. 

The distinction between the two different aspects of a quantum mechanical mea- 
surement process — the retrodictive or determinative aspect versus the predictive or 
preparative one — has been discussed by numerous authors, as has the unavoidable 
perturbation of the system by the measurement [9, 15, 23, 24, 26, 32, 33]. The 
quantities A c jX, A jp, A e fX, A e fp, A d x, A d p, and the inequalities relating them, 
provide a a convenient numerical characterisation of these features. However, it 
ought to be stressed that the characterisation is not complete. It is, for instance, 
clearly impossible to give an exhaustive description of the change in the state of the 
system by only specifying two numbers. The values of the six quantities defined in 
this section encapsulate some important properties of the measurement process. It 
is not to be supposed that they encapsulate every significant property 

The method of describing statistical quantities in terms of variances, and mean 
square values — the method adopted here, in other words — is subject to certain 
limitations. In recent years there has accordingly been some interest in devising al- 
ternative approaches. One approach is that involving parameter-based uncertainty 
relationships [33, 41]. Another approach is that involving entropic uncertainty re- 
lationships [24, 25, 26, 42]. It would be interesting to see if either or both of these 
approaches could be used to develop the formalism of this paper. 

It will be observed, that the retrodictive and predictive rms errors enter Eqs. (8- 
10) in a completely symmetric manner. As we will see the two kinds of error 
can also be fixed independently of one another (in the case of the Arthurs-Kelly 
process). With the appropriate choice of initial apparatus state it is possible to 
arrange for the retrodictive errors to be small while the predictive errors are large; 
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or for the predictive errors to be small while the rctrodictive errors are large; or for 
the retrodictive and predictive errors both to be small. In particular, there exists 
an initial apparatus state for which A e jX = A e fX, A c ip = A e fp and for which the 
products A e jxA e ip and A e fxA e fp both take the minimum value of |. We will refer 
to such measurements as completely optimal. They are discussed in Section 6. 

It should, however, be stressed that this symmetry between the retrodictive and 
predictive aspects of a simultaneous measurement process disappears when one 
turns to the physical interpretation of the quantities A c jX, A c jp, A e fX, A c fp, as 
discussed in Section 5 of rcf. [12]. One should also bear in mind the fact just 
mentioned, that these quantities only afford a partial characterisation of the mea- 
surement process. When we refer to them as the rms errors of retrodiction and 
prediction we mean, strictly and precisely, no more, and no less than what is stated 
in Section 5 of ref. [12]. 

The subject of retrodiction and prediction, and the relationship between them, 
raises some deep conceptual questions which have been the subject of discussion 
ever since the formulation of the modern theory of quantum mechanics in the mid- 
1920's [43]. It appears to us that the methods developed in ref. [12], and in the 
present paper, might be used to gain some additional insight into these questions. 
However, there is clearly a great deal more work which needs to be done. 

3. The Distribution of Measured Values 

We see from Eq. (7) that the error and disturbance operators only depend on 
the apparatus observables fix, ^x, fip, ^p- It follows that the rms errors and 
disturbances are independent of the initial system state \xp). We also see that the 
operators exi, exf constitute a complete commuting set for the apparatus state 
space, with conjugate momenta — epi, epf. It will be convenient to work in terms of 
simultaneous eigenkets of exi and exf, which we denote |exi)Cxf)g x . g xf - They are 
related to the simultaneous eigenkets of fix and 7rp, denoted |mx,7Tp)^ x # p , by 

kxi,exf), - = o (exi + £ xf) , (exi — e xi)) - - 

We now use this equation, Eq. (3) and the Baker-Campbcll-Hausdorff identity to 
deduce 

x,Ax,# P ( a; '^ x ' 7rp I^V ,0< ^p) 

= x.Ax^'^'H exp (~27^ p * x ) CX P ex P (-H^xx) |V ® ^ap) 

= & (x - 7T P |V) 4xij£ - xf ((^X + 7T P - X) , (fix - X) |0ap) 

Taking Fourier transforms we get 

J dx' exp [|^ P (x - x')] g Xi) g Xf ((Mx - x 1 ) , (fj, x - x) |^ ap ) ,(x' |V>) 

(11) 

We can now calculate p(/j,x,^p), the probability density function describing the 
result of the measurement: 

p(a*x,Mp) 

2 

. p (x,/i X ,Mp| U \4>® ap ) 

J dx'dx" exp [y P (x" - x')] (/ix - x'| p ei | M x - x") (x 1 1 V) (V> I x") 

(12) 
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where p e; is the reduced initial apparatus state density matrix corresponding to the 
pair e X i, -e Pi : 

<£xi| K |e Xi ) = / de X i tXi ,e xl ( e ^ Cxf \^ ^ I 6 * 1 ' e xf )g xl ,« Xf 

Let H4y,i be the Wigner function describing the initial state of the system, and let 
W €l be the Wigner function corresponding to p ei : 

W ayii (x,p) = i J dy exp (|-py) (x - \y \ ip) (ip \ x + \y) 

W Ci (exi, epi) = J dy exp (-|e Pi t/) (e X i - /5 £i |e X i + 5J/) 

Then the distribution of measured values can be written 

p(px,pp) = J dxdpW ei (p x ~ x,p P -p) W sy ,i{x,p) (13) 

With a suitable choice for the distribution of retrodictive errors, it is possible to 
obtain any member of the class of operational phase space distributions, discussed 
by Davies [16], Wodkiewicz [34] and others [9, 28, 35, 36]. 

The fact, that the Arthurs-Kelly process can be used to obtain any member 
of this class of distributions is shown in Leonhardt [11] (also see Ban [28]). The 
novelty of the derivation just given consists in the fact that we have shown that 
the distribution of measured values depends only on the distribution of retrodictive 
errors, and is independent of the distribution of predictive errors. 

In certain cases the convolution in Eq. (13) can be inverted [24, 35, 44]. This 
means, that the original state can be reconstructed from the measured probability 
distribution provided that the latter is known with perfect accuracy — a fact which is 
sometimes expressed by saying that the measurement is informationally complete [9, 
24]. However, it should be observed that the fact is of less practical usefulness than 
may initially appear due to the amplification of statistical errors which occurs when 
one tries actually to carry out the inversion using real experimental data [45] . 

The right hand side of Eq. (13) only depends on the distribution of retrodictive 
errors. If, on the other hand, one wants to relate p (px, Pp) to the final state of the 
system, then one needs to consider the distribution of predictive errors. We confine 
ourselves to the case when the initial apparatus state factorises: 

(exi, exf I 0a P ) = (exi I <?U (exf I <?U (14) 

Suppose that the pointer positions px, pp are recorded to be in a region D?. Let 
psy.f be the reduced density matrix representing the state of the system immediately 
after the measurement. Then 

(a; I /3 sy ,f la;') = — / dp x dpp (x,fi X ,pp \ U\ip® </>a P )(V' ® <f>a. P \ & W ,Px,Pp) 
V-R J-r 

where p^ is the probability of finding ^x and [i? in the region ft: 

px= dpxdpp p (px, Pp) 

J 3? 

Using Eqs. (11), (12) and (14) we obtain 
(a; I p sy , f \x') 

= — [ dfixdpp exp [i/xp (x - x')} P (^ x ,Pp) , (fix - x \<f> £! ) (<f> £t \p x - x') 

(15) 
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Let W sy ,{ be the Wigner function describing the final state of the system, and let 
W e! be the Wigner function corresponding to the state \<t> ei ): 

W syi f(x,p) = i J dy exp (j-py) (x - \y\ p sy j \x + \y) 

W e! (e X f , epf) = ^ J dy exp (|e Pf y) (e X f - |y | £f ) (<^ f | e X f + \y) 
Then Eq. (15) becomes 

W sy .i(x,p) = — [ dpxdpp W ei (fix - x,p P -p)p(px,pp) (16) 
Ph. J% 

Eq. (13) shows how the distribution of retrodictive errors can be used to express p 
in terms of W sy .i- Eq. (16) shows how the distribution of predictive errors can be 
used to express W S y,f in terms of p. 

If 31 is a sufficiently small region surrounding the point (px, Pp) 

W sy .f{x,p) w W ef (p x - x, pp -p) 

We see, that with a suitable choice for the distribution of predictive errors, the 
Arthurs-Kelly process can be used to prepare the system in any desired pure state. 

4. Retrodictively Optimal Measurements 



Suppose that the measurement maximises the degree of retrodictive accuracy: 
Define the quantity Aj by 



A ei a:A ci p = ^ (17) 



A ^ A ^ 

A ci a; = —= A ei p = 



V2 ^ 
We will refer to Ai as the retrodictive spatial resolution. 

The necessary and sufficient condition for Eq. (17) to be true is that the initial 
apparatus wave function be of the form 



(exi, exf | 0a P ) = ( ZT2 ) ex P ( ~^ e xi ) & ( £ xf) (18) 



1 



1 \ 3 ( 1 

y, 

where fa (exf) is arL arbitrary normalised function. fa determines the errors of 
prediction. The fact that it is arbitrary means, that requiring the measurement to 
be retrodictively optimal places no constraint on the predictive accuracy. The two 
kinds of error are completely independent. 

In the 7rx, /tp-representation Eq. (18) takes the form 



A 2 

if IV 
(MP - 2 n X) 



(pp + |tt x ) 



2h 2 

where fa is the Fourier transform of fa: 

fa (epf) = \[\ j de X f exp ^--^epfexf^ fa (exf) 

We recognise the wave function considered by Stenholm [4] . 

Using Eq. (13) we find, for the probability distribution of measured values, 

p(^x,pp) = Qi,A, (Mx,Mp) 
where is the initial system state Husimi function [37, 38]: 



?: a (Mx,Mp) ^ / '/•'■'/y exp 



^2 (/1 X -X) 2 - ^ (MP -P) 2 
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The fact that the Husimi function gives the distribution of measured values for the 
case of maximal accuracy was first shown by Ali and Prugovecki [18], working in 
terms of the approach based on POVM's and unsharp observables. The speciali- 
sation of this result to the case of the Arthurs-Kelly process was first discussed by 
Busch [23]. 

Finally, let us calculate the disturbances in this case. We have 

\2 



/A- 
dexidexf (e X i - £xf) 2 |(exi, exf | 4> ap )\ 2 = y + (A cf x) 

(A d p) 2 = J de Pi de Pf (e Pi - e Pf ) 2 |(e Pi , e Pf | </> ap )| 2 = + ( A efP) 



(19) 



Using the predictive error relationship, Eq. (9), we deduce 

A^x Adp > h 

5. Predictively Optimal Measurements 



We next consider measurements which maximise the predictive accuracy: 
Define the quantity Af by 



A cf x A of p = | (20) 



A 

A cf X = — = A cf p : 



V2 CI " V2\ { 

We will refer to Af as the predictive spatial resolution. The necessary and sufficient 
condition for Eq. (20) to be true is that (exi, exf | </>a P ) be of the form 



IV f 1 2 



(exi, exf | <M = ( ^ exp -—^ e xf ) <f>i (e X i) (21) 



Suppose that the final pointer positions are recorded to be in the region Ji. In view 
of Eq. (15) the final system state reduced density matrix is given by 

P sy ,{ = —[ dnxdfipp(nx,fJ.p) |(/i X ,Mp)A f ) ((a i x,Mp)a, | (22) 
where \(px, Pp)\ { ) is the coherent state with wave function 



(x|(^ x ,Mp) Af ) = ( ^2 ) CX P 



1 , ,2 . i 



(x - ^ x ' 2 

A f 

Let Pf iAf be the anti-Husimi function describing the final state of the system (the 
P-representation of Glauber and Sudarshan). We have [38, 39] 

P S y,f = J dfj, x d(J,p P{,x f G"x,Mp) |Ox,Aip) Af ) ((Mx,Mp) Af I (23) 

Comparing Eqs. (22) and (23) we deduce 

p , v /^-p(Mx,Mp) if(Mx,Mp)e^ 

Pf,\i {Px,Pp) = < ™ 

I otherwise 

If 31 is a sufficiently small region surrounding the point (fix,Pp), tnen the system 
is approximately in the state |(/x x ,Mp)a ( ) after the measurement: 

PsyJ- |(MX,Mp) Af )((MX,Mp)A f l 

The reader may easily verify that 

AdX Adp > h 
as in the case of a retrodictively optimal process. 
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6. Completely Optimal Measurements 

Suppose that the measurement is both retrodictively optimal at spatial resolution 
A;, and predictively optimal at spatial resolution Af . In view of Eqs. (18) and (21) 
the initial apparatus wave function must be 

(exi, exf | 0a P ) = (vrAiAf)" 5 exp (-^2 e !i - ^2 e xf 



We have from Eq. (19) 



- - IK 2 '^ f' /; 



The necessary and sufficient condition for this expression to achieve its lower bound 
is that the retrodiction and prediction both be at the same spatial resolution: 
A; = Af = A, say. We then have, in the /&x,Ap representation 



2 ( 1 2 A 2 1; 



(Mx, MP I 0a P ) = cxp \-^x - ^2 Mp 

which is the wave function considered by Arthurs and Kelly [2] . With this choice of 
\4> ap ) the process produces the least amount of disturbance consistent with maximal 
accuracy. It might therefore be described as a completely optimal process. 

It is interesting to note, however, that Ai and Af are completely independent. 
One could, for instance, have A; — > and Af — > oo — so that the measurement 
gives an almost perfectly accurate retrodiction of position, whilst at the same time 
preparing the system in an approximate eigenstate of momentum. Of course, the 
disturbances would then be very large. 



7. Minimally Disturbing Measurements 

It is possible to make A^xA^p smaller than h, provided that one is willing to 
accept some loss of accuracy. In this section we address the question: what is 
the greatest accuracy which can be achieved for a given level of disturbance? We 
confine ourselves to the case when the rctrodictivc and predictive errors are equal: 

A ci x = A cf x and A ci p = A ei p (24) 

We assume that the product of disturbances is given by 

A d a; A dP = he' 71 (25) 

for some rj > 0. We then ask: what is the least value of the product A e {X A e ip = 
A c fX A c fp subject to this constraint? And: what is the probability distribution of 
measured values when the lower bound is achieved? 
It is convenient to define 

ex = \ (exi + exf ) and e P = \ (e Pi + e Pf ) (26) 

We also have 

$x = exi - exf and S P = i Pi - e Pf (27) 

Consequently 

[ex, Sp] = [Sx, ep] = -ih 

all other commutators between ex, <5x, e P , Sp being zero. We see that ex, Sx consti- 
tute a complete commuting set of apparatus observables, with conjugate momenta 
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-<5p, — ep. In particular 



(0ap £x 0ap) <</ 


ap | 




K) > 


ft 2 

4 






(0ap *x l^ap) (</ 


>a P | 


^P | 


K) > 


ft 2 
4 






It follows from Eqs. (26) and (27) 














(A ci x) 2 = (</> ap 6 X ap ) + -( 


^ap 




0ap) 4 


(</>ap 


ex<Jx 


<^ap) 


(A cf x) 2 = <0 ap e x </> ap ) + i( 


^ap 


\si 


0ap) - 


(<£ap 


ex<Jx 


<^ap) 



(28) 



In view of Eq. (24) we must have 



ex<5x |0a P ) =0 



and 



Similarly 



(A ci 2;) 2 = (A f.x) 2 = (<^ap| e x \<f>a, P ) + ^(0ap| <5 X |0ap) 



(A oi p) 2 = (A of p) 2 = (0 ap | £p \<j) ap ) + ^< 

In view of Eq. (25) we can choose A such that 

A d x = A cxp (-2) 

A d p = j exp(-2) 

In view of Eq. (28) we must then have 



<^ ap | e x |0 ap ) > —e 11 

h 2 

(0ap|e P |0ap) > 

Inserting these results in Eqs. (29) and (30) gives 

A 2 

{A ci x) = (A ef x) > — cosh?] 
ti 2 

(A ciP ) 2 = (A cfP ) 2 > — cosh t? 



6ap| Sp \<j) ap ) 



(29) 
(30) 



(31) 



whence 



A ei a; A ci p = A ef x A ci p > ^ cosh r\ 



which is the desired inequality. 

The product of errors achieves its lower bound if and only if the lower bounds 
set by Eq. (31) are achieved, so that |</> a p) is a minimum uncertainty state with 
respect to the pairs ex, —Sp and Sx, — ep. In the e"x, (^-representation 



(ex,<$x I ^ap) = 



exp 



1 



-^(e-"e x + Ie^ x ) 



Transforming to the exi,exf -representation we find 



(exi,e X f | </> ap ) 



1 



V^A 



exp 



2A 2 ( cosll ^ e xi - 2sinhr/exiexf + cosh?7e xf ) 
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Using Eq. (13) we obtain the probability distribution of measured values 

1 



_J_I 

h cosh rj J 



dxdp exp 



■py 



Wi, sy (X,p) 



This is a smeared Wigner function, of the kind proposed by Cartwright [35, 36, 46]. 
For i] = it reduces to the Husimi function. As r\ increases W syi i is smoothed 
over increasingly large regions of phase space — in agreement with the fact, that the 
larger 77, the less accurate the measurement. 

8. Conclusion 

In this paper we have only considered the Arthurs-Kelly process. It would be 
interesting to know whether the results obtained generalise to some of the other 
measurement processes which have been discussed in the literature [5, 6, 7, 10, 11]. 
We hope to return to this question in a future publication. 

Appendix: the Definition of the Retrodictive Errors 

In Eq. (6) we defined the rms errors of retrodiction A e [X, A e jp in terms of the 
retrodictive error operators exi, epi- The purpose of this appendix is to indicate 
the relationship between these quantities and the definition of the measurement 
inaccuracies which was given by Ali and Prugovecki [9, 18]. 

Consider a measurement process in which a system, with position x and momen- 
tum p interacts with an apparatus, characterised by two pointer observables /tx, 
/ip . Let ip and ap be the initial states of the system and apparatus respectively. 
Let U be the unitary evolution operator describing the interaction, so that the fi- 
nal state of the system+apparatus is U \ip <g> <p ap ) (in the Schrodinger picture). We 
no longer confine ourselves to the case of the Arthurs-Kelly process, so U is not 
assumed to have the particular form specified by Eq. (3). 



For a given choice of initial apparatus state </> ap we have 

(x,^x,Mp| ^|^O^ap) = J dx' K(x,nx,Lip]x r ) (x' \tp) 

for some kernel K (the choice of K being dependent on the choice of 
unitarity of U implies 



dxd/j, x dfJ,p K(x, fj, x , Mp; x i) k * ( x , Mx, Mp; x 2 ) = S(xi - x 2 ) 
The distribution of measured values is given by 



(32) 
6 ap ). The 

(33) 



Kmx,Mp) 



dx 



= J dxdx'idx' 2 K(x,HK,nv,x'i)K*(x,HK,iip;x'2) (x'i \ip)(i> \x' 2 ) 
or, in terms of the p-representation wavefunction, 

P(Mx,Mp) = j dpdp\dp' 2 k{p 1 ^ i iY>:p' l ) k*{p,^iiY>\p' 2 ) (p'i \^)(ip \p' 2 ) 



where 



K(p,fix,Hp;p') = ^2 / dxdx ' ex P 



-{p'x 1 -px) 



K(x, nx, fJ-p;x') 
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The marginal distributions can be written 

J dp P p(p x , pp) = J dx\ dx' 2 fx{px;x'i,x' 2 ) (x'i \ip)(ip \x' 2 ) 

J dp,x pif-x, Pp) = J dp' 1 dp , 2fp(fi,p;p' 1 ,p'2)(p'i\i>)(il>\p , 2) 



where 



/x(Mx; x'i, x' 2 ) = J dxd^p K(x, fix, Mp; x\) K*(x, /z x , Pp; x' 2 ) 

fp(fj-p;p'i,p'2) = J dpdpxK(p,px,pp;p'i)K*(p,px,pp;p'2) 



(34) 



Ali and Prugovecki [18] confine themselves to the class of processes for which / x 
and fp have the form 

fx(Px;x'i,x' 2 ) = Xx(Px,x\) S(x' 2 - x'i) 

/p(mp;p'i,p'2) = xp(vp,p'i)5{p'2 -p'i) 

for some pair of functions Xx, XPi and they focus on the even more restricted class 
for which 

fx{nx;x'i,x' 2 ) = XxoifJ-x - x\)5{x' 2 -x\) ^ 

/p(Mp;p'i,p'2) = Xpo(mp -p'i)S{p' 2 - p'i) 

for some pair of functions xxoj Xpo, This assumption is valid in the case of the 
Arthurs-Kelly process. However, there does not seem to be any reason to expect it 
to be true generally. 

If fx and fp do satisfy the condition of Eq. (35), the marginal probability dis- 
tributions for fix and fip can be written as convolutions of the initial system state 
probability distributions for x and p: 

J dp P p(p x ,pp) = j dx' XxoifJ-x -x') \(x' |V>)| 2 

J dp x p(px,pp) = j dp' xpo(hp ~p') \(p' \i>)\ 2 

Ali and Prugovecki [9, 18] define the measurement inaccuracies in terms of the 
widths of these convolutions: 

i 

2\ 2 

dxxxxo(x) 



cr x = dxx 2 Xxq{x) - {^J 
aF = {J dpp2)<P0 ^~ dpPXPo(p) 



(36) 

2\ 2 



Let us now compare these definitions with the definitions of the rms errors of 
retrodiction used in this paper. Referring back to Eq. (5) we see 

ap 

= (ip® ap | U^tAU + U ] PxUx + xll^fixU + x 2 \ip <g> ap ) 
Using Eqs (32) and (34) we find 

(V"2> <Aap| U^lAP \lp® </> ap ) = J dflxdflpdx fix (x,Px,Pp\ U |V"8>0ap) 

dtixdx' 1 dx'2Pxfx{px;x' 1 ,x , 2)(x'i \ tp){i>\ x' 2 ) 
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and 

(ip ® <j> ap \ U^fixUx \ip <g> 0ap) 

= J dfixdfipdx ^ x (o;,^x,Mp| ^ |V> ® ap ) (V> ® ap | |x,^ x ,Mp) 

= J duxdx'idx^Vxx'i fx(Hx\x\,x' 2 ) (x\ | V) (V>l ^'2) 

Similarly 

® ap | xU^HxU \ip ® </> ap ) 

= J dfj,xdx'idx'2fixx , 2fx(tJ'X\x'i,x'2) (x'i x' 2 ) 

Lastly 

(V"8)0ap|i 2 |V"X>0a P ) = / da;' a;' 2 | (a;' jV')) 2 



diixdx\dx'2x'xx'2fx{tJ>x;x\,rf2) (x'i I V) W>\ ^'2} 

where we have used the fact 

dfJ,xfx(vx;x'i,x'2) = 5(x\ - x'2) 



as follows from the unitarity condition, Eq. (33). Putting these results together 
gives 

(A oi x) 2 = J dpLxdx'xdx'2 (fix - x'i)(fix - x' 2 ) fx(fix;x'ux' 2 ) (x'i \tp)(tp \x' 2 ) 

In the same way we can derive 

(A ci p) 2 = J dfipdp x dp 2 (fi P - p\)(fip - p' 2 ) fp(fip;p'i,p' 2 ) (p'i \i>)(i> |p' 2 > 

These equations hold quite generally. If the functions fx, fp are of the form given 
by Eq. (35), then 

(A ci x) 2 = Jdxx 2 xxo(x) 

(A oi p) 2 = J dpp 2 xpo(p) 

Comparing these equations with Eq. (36) we see, that the only difference is, that Ali 
and Prugovecki work in terms of the standard deviations, whereas we use the rms 
values. Our definition essentially coincides with theirs, therefore, for the class of 
measurement processes which they consider (and which includes the Arthurs-Kelly 
process discussed in this paper). 

Our definition of the retrodictive errors applies to a larger class of measurement 
processes than the definition of Ali and Prugovecki. It also has other advantages, 
since it brings out the connection with the retrodictive error operators. In ref. [12] 
we used this connection to answer the question of principle raised by TJffink, and 
to prove the error-error and error-disturbance relationships of Eqs. (8-10). In the 
present paper we saw how the expansion in terms of eigenstates of the retrodictive 
and predictive error operators provides a very convenient way of analysing the 
mathematical properties of the Arthurs-Kelly process. 
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